TEI and Microsoft: a marriage made in...: Using proprietary tools for producing standardly encoded editions

نویسنده

  • Tomaž Erjavec
چکیده

1. XML and editorial interventions It is generally assumed that scholarly annotated digital texts, be they text-critical editions or linguistically annotated corpora, should be stored in XML to ensure longevity as well as platform and software independence. Furthermore, the TEI Guidelines (SperbergMcQueen and Burnard, 2002) are the de-facto standard for constructing the XML schemas for most such texts. But while XML is well-suited for machine processing, it is less than ideal for authorial or editorial interventions into the text, esp. when used with the complex TEI-derived schemas. It is of course possible to edit XML documents directly in a plain text editor, or better yet in specialised XML editors that support on-the-fly validation against a schema and schema-dependent drop-down menus. But, depending on the text type and required manual interventions, these generic editors are too clumsy for extensive work, and do not enable complex constraints on the allowed content and changes in the annotations. An additional problem with using XML editors for editorial work is the fact that many humanities scholars or students who are most likely to be doing this work have no knowledge of XML or TEI or any experience in editing it. This problem becomes all the more relevant in collaborative projects in which, say, a large number of students are hired to annotate a certain text or text collection. The effort required to first teach them how to use an XML editor and the underlying concepts might be prohibitive, and saving time necessary to perform each editorial intervention is essential.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Architecture for Editing Complex Digital Documents

In several on-going projects we were faced with the dilemma of how to reconcile our goal of delivering standardly encoded digital editions, yet have the actual editing and annotation performed by researchers and students who had no knowledge of XML and the Text Encoding Initiative Guidelines (TEI), and, for the most part, no great interest in learning them. The developed solution consists of al...

متن کامل

Semantic Cartography: Using RDF/OWL to Build Adaptable Tools for Text Exploration

Texts encoded using the Text Encoding Initiative Guidelines (TEI) are ideally suited to close examination using a variety of digital methodologies and tools. However, because the TEI is a broad set of guidelines rather than a single schema, and because encoding practices and standards vary widely between collections, programmatic interchange among projects can be difficult. Software that operat...

متن کامل

Encoding models for scholarly literature

In this chapter, the authors examine the issue of digital formats for document encoding, archiving and publishing, through the specific example of “born-digital” scholarly journal articles. This small area of electronic publishing represents a microcosm of the state of the art, and provides a good basis for this discussion. The authors will begin by looking at the traditional workflow of journa...

متن کامل

رابطه معنی زندگی و بهزیستی روان‌شناختی با تمایل به ازدواج در دانشجویان

The study aims at investigating the relationship between the cognitive constructs of those who crave for marriage and those who elude marriage and psychological well-being and meaning of life. This is a descriptive research conducted using the correlational method. The statistical sample includes 106 people either eluding or craving for marriage, who were selected by simple random sampling meth...

متن کامل

Specifying a TEI-XML Based Format for Aligning Text to Image at Character Level

This papers presents an experience of specifying and implementing an XML format for text to image alignment at word and character level within the TEI framework. The format in question is a supplementary markup layer applied to heterogeneous transcriptions of medieval Latin and French manuscripts encoded using different “flavors” of the TEI (normalized for critical editions, diplomatic or palae...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007